SEQUENCE LISTING 



<110> DREYFUS , MARC 
LOPEZ , PASCAL 

<120> MUTANT E. COLI STRAINS , AND THEIR USE FOR PRODUCING 
RECOMBINANT POLYPEPTIDES 

<13 0> USB 9 8 APCNREC 

<140> 09/762,481 
<141> 2001-07-02 

<150> PCT/FR99/01879 
<151> 1999-07-29 

<150> FR 98/10197 
<151> 1998-08-07 

<160> 6 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 3661 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (441) . . (3623) 
<400> 1 

gaaaaaactg tgagtaagcg ggtgataaat ggtaaaagtc atcttgctat aacaaggctt 60 

gcagtggaat aatgaggccg tttccgtgtc catccttgtt aaaacaagaa attttacgga 12 0 

ataacccatt ttgcccgacc gatcatccac gcagcaatgg cgtaagacgt attgatcttt 18 0 

caggcagtta gcgggctgcg ggttgcagtc cttaccggta gatggaaata tttctggaga 24 0 

gtaataccca gtctgtttct tgtataattg cgctgttttt ccgcatgaaa aacgggcaac 3 00 

cgacactctg cgcctctttg agctgacgat aaccgtgagg ttggcgacgc gactagacac 3 60 

gaggccatcg gttcacaccc ggaaaggcgt tactttgccc gcagcttagt cgtcaatgta 42 0 

agaataatga gtaagttacg atg aaa aga atg tta ate aac gca act cag cag 473 

Met Lys Arg Met Leu lie Asn Ala Thr Gin Gin 
15 10 

gaa gag ttg cgc gtt gec ctt gta gat ggg cag cgt ctg tat gac ctg 521 
Glu Glu Leu Arg Val Ala Leu Val Asp Gly Gin Arg Leu Tyr Asp Leu 
15 20 25 



gat ate gaa agt cca ggg cac gag cag aaa aag gca aac ate tac aaa 
Asp lie Glu Ser Pro Gly His Glu Gin Lys Lys Ala Asn lie Tyr Lys 
30 35 40 



569 



2 



ggt aaa ate acc cgc att gaa ccg agt ctg gaa get get ttt gtt gat 617 
Gly Lys lie Thr Arg lie Glu Pro Ser Leu Glu Ala Ala Phe Val Asp 
45 50 55 



tac ggc get gaa cgt cac ggt ttc etc cca eta aaa gaa att gcc cgc 
Tyr Gly Ala Glu Arg His Gly Phe Leu Pro Leu Lys Glu He Ala Arg 



60 65 70 



75 



gat tac tta cgt cag gac ate ggc gaa ate ctt ate gat aac ccg aaa 
Asp Tyr Leu Arg Gin Asp He Gly Glu He Leu He Asp Asn Pro Lys 
220 225 230 



235 



665 



713 



gaa tat ttc cct get aac tac agt get cat ggt cgt ccc aac att aaa 
Glu Tyr Phe Pro Ala Asn Tyr Ser Ala His Gly Arg Pro Asn He Lys 
80 85 so 

gat gtg ttg cgt gaa ggt cag gaa gtc att gtt cag ate gat aaa gaa 761 
Asp Val Leu Arg Glu Gly Gin Glu Val He Val Gin He Asp Lys Glu 
95 100 105 

gag cgc ggc aac aaa ggc gcg gca tta acc acc ttt ate agt ctg gcg 8 09 
Glu Arg Gly Asn Lys Gly Ala Ala Leu Thr Thr Phe He Ser Leu Ala 
110 115 120 

ggt age tat ctg gtt ctg atg ccg aac aac ccg cgc gcg ggt ggc att 8 57 
Gly Ser Tyr Leu Val Leu Met Pro Asn Asn Pro Arg Ala Gly Gly He 
125 130 135 

tct cgc cgt ate gaa ggc gac gac cgt acc gaa tta aaa gaa gca ctg 905 
Ser Arg Arg He Glu Gly Asp Asp Arg Thr Glu Leu Lys Glu Ala Leu 
140 145. iso 155 

gca age ctt gaa ctg ccg gaa ggc atg ggg ctt ate gtg cgc acc get 953 
Ala Ser Leu Glu Leu Pro Glu Gly Met Gly Leu He Val Arg Thr Ala 
160 165 170 

ggc gtc ggc aaa tct get gag gcg ctg caa tgg gat tta age ttc cgt 1001 
Gly Val Gly Lys Ser Ala Glu Ala Leu Gin Trp Asp Leu Ser Phe Arg 
x75 18 0 * 185 

ctg aaa cac tgg gaa gcc ate aaa aaa gcc get gaa age cgc ccg gcc 1049 
Leu Lys His Trp Glu Ala He Lys Lys Ala Ala Glu Ser Arg Pro Ala 
190 195 200 

ccg ttc ctg. att cat cag gag age aac gta ate gtt cgc gca ttc cgc 10 97 
Pro Phe Leu He His Gin Glu Ser Asn Val He Val Arg Ala Phe Arg 
205 210 215 



1145 



gtg etc gaa ctg gca cgt cag cat ate get gca tta ggt cgc ccg gat 1193 , 

Val Leu Glu Leu Ala Arg Gin His He Ala Ala Leu Gly Arg Pro Asp 

240 245 " 250 

ttc age age aaa ate aaa ctg tac acc ggc gag ate ccg ctg ttc age 1241 

Phe Ser Ser Lys He Lys Leu Tyr Thr Gly Glu He Pro Leu Phe Ser 

255 260 265 



3 



cac tac cag ate gag tea cag ate gag tec gee ttc cag cgt gaa gtt 128 9 
His Tyr Gin lie Glu Ser Gin lie Glu Ser Ala Phe Gin Arg Glu Val 
270 275 280 

cgt ctg ccg tct ggt ggt tec att gtt ate gac age ace gaa gcg tta 1337 
Arg Leu Pro Ser Gly Gly Ser lie Val lie Asp Ser Thr Glu Ala Leu 

. 285 290 . 295 

acg gee ate gac ate aac tec gca cgc gcg ace cgc ggc ggc gat ate 13 85 
Thr Ala lie Asp lie Asn Ser Ala Arg Ala Thr Arg Gly Gly Asp lie 
300 305 310 315 

gaa gaa ace gcg ttt aac act aac etc gaa get gee gat gag att get 143 3 
Glu Glu Thr Ala Phe Asn Thr Asn Leu Glu Ala Ala Asp Glu lie Ala 
320 325 330 

cgt cag ctg cgc ctg cgt gac etc ggc ggc ctg att gtt ate gac ttc 1481 
Arg Gin Leu Arg Leu Arg Asp Leu Gly Gly Leu lie Val lie Asp Phe 
335 340 345 

ate gac atg acg cca gta cgc cac cag cgt gcg gta gaa aac cgt ctg 1529 
He Asp Met Thr Pro Val Arg His Gin Arg Ala Val Glu Asn Arg Leu 
350 355 360 

cgt gaa gcg gtg cgt cag gac cgt gcg cgt att caa ate age cat att 1577 
Arg Glu Ala Val Arg Gin Asp Arg Ala Arg He Gin He Ser His He 
365 370 375 

tct cgc ttt ggc ctg ctg gaa atg tec cgt cag cgc ctg age cca tea 162 5 
Ser Arg Phe Gly Leu Leu Glu Met Ser Arg Gin Arg Leu Ser Pro Ser 
380 385 390 395 

ctg ggt gaa tec agt cat cac gtt tgt ccg cgt tgt tct ggt act ggc 1673 
Leu Gly Glu Ser Ser His His Val Cys Pro Arg Cys Ser Gly Thr Gly 
400 405 410 

ace gtg cgt gac aac gaa teg ctg teg etc tct att ctg cgt ctg ate 1721 
Thr Val Arg Asp Asn Glu Ser Leu Ser Leu Ser He Leu Arg Leu lie 
415 420 425 

gaa gaa gaa gcg ctg aaa gag aac ace cag gaa gtt cac gee att gtt 17 69 
Glu Glu Glu Ala Leu Lys Glu Asn Thr Gin Glu Val His Ala He Val 
430 435 440 

cct gtg cca ate get tct tac ctg ctg aat gaa aaa cgt tct gcg gta 1817 
Pro Val Pro He Ala Ser Tyr Leu Leu Asn Glu Lys Arg Ser Ala Val 
445 450 455 

aat gee att gaa act cgt cag gac ggt gtg cgc tgt gta att gtg cca 1865 
Asn Ala He Glu Thr- Arg Gin Asp Gly Val Arg Cys Val He Val Pro 
460 465 470 475 

aac gat cag atg gaa ace ccg cac tac cac gtg ctg cgc gtg cgt aaa 1913 
Asn Asp Gin Met Glu Thr Pro His Tyr His Val Leu Arg Val Arg Lys 
480 485 490 
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ggg gaa gaa acc cca acc tta age tac atg ctg ccg aag ctg cat gaa 1961 
Gly Glu Glu Thr Pro Thr Leu Ser Tyr Met Leu Pro Lys Leu His Glu 
495 500 505 

gaa gcg atg gcg ctg ccg tct gaa gaa gag ttc get gaa cgt aag cgt 2 009 
Glu Ala Met Ala Leu Pro Ser Glu Glu Glu Phe Ala Glu Arg Lys Arg 
510 515 520 

ccg gaa caa cct gcg ctg gca acc ttt gec atg ccg gat gtg ccg cct 2057 
Pro Glu Gin Pro Ala Leu Ala Thr Phe Ala Met Pro Asp Val Pro Pro 
525 530 535 

gcg cca acg cca get gaa cct gee gcg cct gtt gta get cca gca ccg 2105 
Ala Pro Thr Pro Ala Glu Pro Ala Ala Pro Val Val Ala Pro Ala Pro 
540 545 550 555 

aaa get gca ccg gca aca cca gca get cct gca caa cct ggg ctg ttg 2153 
Lys Ala Ala Pro Ala Thr Pro Ala Ala Pro Ala Gin Pro Gly Leu Leu 
560 565 570 

age cgc ttc ttc ggc gca ctg aaa gcg ctg ttc age ggt ggt gaa gaa 22 01 
Ser Arg Phe Phe Gly Ala Leu Lys Ala Leu Phe Ser Gly Gly Glu Glu 
575 580 585 

acc aaa ccg acc gag caa cca gca ccg aaa gca gaa gcg aaa ccg gaa 2249 
Thr Lys Pro Thr Glu Gin Pro Ala Pro Lys Ala Glu Ala Lys Pro Glu 
590 595 600 

cgt caa cag gat cgt cgc aag cct cgt cag aac aac cgc cgt gac cgt 2297 
Arg Gin Gin Asp Arg Arg Lys Pro Arg Gin Asn Asn Arg Arg Asp Arg 
605 610 615 

aat gag cgc cgc gac acc cgt agt gaa cgt act gaa ggc age gat aat 2345 
Asn Glu Arg Arg Asp Thr Arg Ser Glu Arg Thr Glu Gly Ser Asp Asn 
620 625 630 635 

cgc gaa gaa aac cgt cgt aat cgt cgc cag gca cag cag cag act gec 23 93 
Arg Glu Glu Asn Arg Arg Asn Arg Arg Gin Ala Gin Gin Gin Thr Ala 
640 645 650 

gag acg cgt gag age cgt cag cag get gag gta acg gaa aaa gcg cgt 2441 
Glu Thr Arg Glu Ser Arg Gin Gin Ala Glu Val Thr Glu Lys Ala Arg 
655 660 665 

_.acc gec gac gag cag caa gcg ccg cgt cgt gaa cgt age cgc cgc cgt 2489 
Thr Ala Asp Glu Gin Gin Ala Pro Arg Arg Glu Arg Ser Arg Arg Arg 
670 675 680 

aat gat gat aaa cgt cag gcg caa caa gaa gcg aag gcg ctg aat gtt 2537 
Asn Asp Asp Lys Arg Gin Ala Gin Gin Glu Ala Lys Ala Leu Asn Val 
685 690 695 

gaa gag caa tct gtt cag gaa acc gaa cag gaa gaa cgt gta cgt ccg 2 5 85 
Glu Glu Gin Ser Val Gin Glu Thr Glu Gin Glu Glu Arg Val Arg Pro 
700 705 710 715 
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gtt cag ccg cgt cgt aaa cag cgt cag etc aat cag aaa gtg cgt tac 2633 
Val Gin Pro Arg Arg Lys Gin Arg Gin Leu Asn Gin Lys Val Arg Tyr 
720 725 ^ 730 

gag caa age gta gec gaa gaa gcg gta gtc gca ccg gtg gtt gaa gaa 2 681 
Glu Gin Ser Val Ala Glu Glu Ala Val Val Ala Pro Val Val Glu Glu 
735 740 745 

act gtc get gec gaa cca att gtt cag gaa gcg cca get cca cgc- aca 2729 
Thr Val Ala Ala Glu Pro He Val Gin Glu Ala Pro Ala Pro Arg Thr 
750 755 760 

gaa ctg gtg aaa gtc ccg ctg cca gtc gta gcg caa act gca cca gaa 2777 
Glu Leu Val Lys Val Pro Leu Pro Val Val Ala Gin Thr Ala Pro Glu 
765 770 775 

cag caa gaa gag aac aat get gat aac cgt gac aac ggt ggc atg ccg 2 82 5 
Gin Gin Glu Glu Asn Asn Ala Asp Asn Arg Asp Asn Gly Gly Met Pro 
780 785 790 795 

cgt cgt tct cgc cgc teg cct cgt cac ctg cgc gta agt ggt cag cgt 2 873 
Arg Arg Ser Arg Arg Ser Pro Arg His Leu Arg Val Ser Gly Gin Arg 
800 805 810 

cgt cgt cgc tat cgt gac gag cgt tat cca ace cag teg cca atg ccg 2921 
Arg Arg Arg Tyr Arg Asp Glu Arg Tyr Pro Thr Gin Ser Pro Met Pro 
815 820 825 

ttg acc gta gcg tgc gcg tct ccg gaa ctg gee tct ggc aaa gtc tgg 2969. 
Leu Thr Val Ala Cys Ala Ser Pro Glu Leu Ala Ser Gly Lys Val Trp 
830 835 840 

ate cgc tat cca att gta cgt ccg caa gat gta cag gtt gaa gag cag 3 017 
He Arg Tyr Pro He Val Arg Pro Gin Asp Val Gin Val Glu Glu Gin 
845 850 855 

cgc gaa cag gaa gaa gta cat gtg cag ccg atg gtg act gag gtc cct 3 065 
Arg Glu Gin Glu Glu Val His Val Gin Pro Met Val Thr Glu Val Pro 
860 865 870 875 

gtc gec gec get ate gaa ccg gtt gtt age gcg cca gtt gtt gaa gaa 3113 
Val Ala Ala Ala He Glu Pro Val Val Ser Ala Pro Val Val Glu Glu 
880 885 890 

gtg gec ggt gtc gta gaa gec ccc gtt cag gtt gee gaa ccg caa ccg 3161 
Val Ala Gly Val Val Glu Ala Pro Val Gin Val Ala Glu Pro Gin Pro 
895 900 905 

gaa gtg gtt gaa acg acg cat cct gaa gtg ate get gee gcg gta act 3209 
Glu Val Val Glu Thr Thr His Pro Glu Val He Ala Ala Ala Val Thr 
910 915 920 

gaa cag ccg cag gtg att acc gag tct gat gtt gee gta gee cag gaa 3257 
Glu Gin Pro Gin Val He Thr Glu Ser Asp Val Ala Val Ala Gin Glu 
925 930 935 



gtt gcagaa caa gca gaa ccg gtg gtt gaa ccg cag gaa gag acg gca 3305 
Val Ala Glu Gin Ala Glu Pro Val Val Glu Pro Gin Glu Glu Thr Ala 
940 945 950 955 

gat att gaa gaa gtt gtc gaa act get gag gtt gta gtt get gaa cct 3 353 
Asp He Glu Glu Val Val Glu Thr Ala Glu Val Val Val Ala Glu Pro 
960 965 970 

gaa gtt gtt get caa cct gee gcg cca gta gtc get gaa gtc gca gca 34 01 
Glu Val Val Ala Gin Pro Ala Ala Pro Val Val Ala Glu Val Ala Ala 
975 980 985 

gaa gtt gaa acg gta get gcg gtc gaa cct gag gtc acc gtt gag cat 3449 
Glu Val Glu Thr Val Ala Ala Val Glu Pro Glu Val Thr Val Glu His 
990 995 1000 

aac cac get acc gcg cca atg acg cgc get cca gca ccg gaa tat gtt 34 97 
Asn His Ala Thr Ala Pro Met Thr Arg Ala Pro Ala Pro Glu Tyr Val 
1005 1010 ~ 1015 

ccg gag gca ccg cgt cac agt gac tgg cag cgc cct act ttt gec ttc 3 545 
Pro Glu Ala Pro Arg His Ser Asp Trp Gin Arg Pro Thr Phe Ala Phe 
1020 1025 1030 1035 

gaa ggt aaa ggt gee gca ggt ggt cat acg gca aca cat cat gec tct 3593 
Glu Gly Lys Gly Ala Ala Gly Gly His Thr Ala Thr His His Ala Ser 
1040 1045 1050 

gec get cct gcg cgt ccg caa cct gtt gag taataattag ctcaaagtaa 3643 
Ala Ala Pro Ala Arg Pro Gin Pro Val Glu 
1055 1060 

tcaagccctg gtaactgc 3661 

<210> 2 
<211> 1061 
<212> PRT 

<213> Escherichia coli 
<400> 2 

Met Lys Arg Met Leu He Asn Ala Thr Gin Gin Glu Glu Leu Arg Val 
1 5 10 15 

Ala Leu Val Asp Gly Gin Arg Leu Tyr Asp Leu Asp He Glu Ser Pro 
20 25 *" 30 

Gly His Glu Gin Lys Lys Ala Asn He Tyr Lys Gly Lys He Thr Arg 
35 40 ' 45 

He Glu Pro Ser Leu Glu Ala Ala Phe Val Asp Tyr Gly Ala Glu Arg 
50 55 60 

His Gly Phe Leu Pro Leu Lys Glu He Ala Arg Glu Tyr Phe Pro Ala 
65 70 75 ~ - 80 

Asn Tyr Ser Ala His Gly Arg Pro Asn He Lys Asp Val Leu Arg Glu 
85 90 ~ 95 
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Gly Gin Glu Val He Val Gin He Asp Lys Glu Glu Arg Gly Asn Lys 
100 105 no 

Gly Ala Ala Leu Thr Thr Phe He Ser Leu Ala Gly Ser Tyr Leu Val 
115 120 125 

Leu Met Pro Asn Asn Pro Arg Ala Gly Gly He Ser Arg Arg He Glu 
130 135 . 140 

Gly Asp Asp Arg Thr Glu Leu Lys Glu Ala Leu Ala Ser Leu Glu Leu 
145 150 155 ' 160 

Pro Glu Gly Met Gly Leu He Val Arg Thr Ala Gly Val Gly. Lys Ser 
165 170 175 

Ala Glu Ala Leu Gin Trp Asp Leu Ser Phe Arg Leu Lys His Trp Glu 
180 185 iso 

Ala He Lys Lys Ala Ala Glu Ser Arg Pro Ala Pro Phe Leu He His 
195 200 205 

Gin Glu Ser Asn Val He Val Arg Ala Phe Arg Asp Tyr Leu Arg Gin 
210 215 220 

Asp He Gly Glu He Leu He Asp Asn Pro Lys Val Leu Glu Leu Ala 
225 230 235 240 

Arg Gin His He Ala Ala Leu Gly Arg Pro Asp Phe Ser Ser Lys He 
245 250 255 

Lys Leu Tyr Thr Gly Glu He Pro Leu Phe Ser His Tyr Gin He Glu 
260 265 270 

Ser Gin He Glu Ser Ala Phe Gin Arg Glu Val Arg Leu Pro Ser Gly 
275 280 285 

Gly Ser He Val He Asp Ser Thr Glu Ala Leu Thr Ala He Asp He 
2 ^0 295 300 

Asn Ser Ala Arg Ala Thr Arg Gly Gly Asp He Glu Glu Thr Ala Phe 
305 310 315 320 

Asn Thr Asn Leu Glu Ala Ala Asp Glu He Ala Arg Gin Leu Arg Leu 
325 330 335 

Arg Asp Leu Gly Gly Leu He Val He Asp Phe He Asp Met Thr Pro 
340 345 350 

Val Arg His Gin Arg Ala Val Glu Asn Arg Leu Arg Glu Ala Val Arg 
355 360 365 

Gin Asp Arg Ala Arg He Gin He Ser His He Ser Arg Phe Gly Leu 
370 375 380 

Leu Glu Met Ser Arg Gin Arg Leu Ser Pro Ser Leu Gly Glu Ser Ser 
385 390 395 ' 400 
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His His Val Cys Pro Arg Cys Ser Gly Thr Gly Thr Val Arg Asp Asn 
405 410 415 

Glu Ser Leu Ser Leu Ser lie Leu Arg Leu lie Glu Glu Glu Ala Leu 
420 425 430 

Lys Glu Asn Thr Gin Glu Val His Ala He Val Pro Val Pro He Ala 
435 440 445 

Ser Tyr Leu Leu Asn Glu Lys Arg Ser Ala Val Asn Ala He Glu Thr 
450 455 460 

Arg Gin Asp Gly Val Arg Cys Val He Val Pro Asn Asp Gin Met Glu 
465 470 475 480 

Thr Pro His Tyr His Val Leu Arg Val Arg Lys Gly Glu Glu Thr Pro 
485 490 495 

Thr Leu Ser Tyr Met Leu Pro Lys Leu His Glu Glu Ala Met Ala Leu 
500 505 . 510 

Pro Ser Glu Glu Glu Phe Ala Glu Arg Lys Arg Pro Glu Gin Pro Ala 
515 520 ~ 525 

Leu Ala Thr Phe. Ala Met Pro Asp Val Pro Pro Ala Pro Thr Pro Ala 
530 535 540 

Glu Pro Ala Ala Pro Val Val Ala Pro Ala Pro Lys Ala Ala Pro Ala 
545 550 555 560 

Thr Pro Ala Ala Pro Ala Gin Pro Gly Leu Leu Ser Arg Phe Phe Gly 
565 570 ~ 575 

Ala Leu Lys Ala Leu Phe Ser Gly Gly Glu Glu Thr Lys Pro Thr Glu 
580 585 590 

Gin Pro Ala Pro Lys Ala Glu Ala Lys Pro Glu Arg Gin Gin Asp Arg 
595 600 605 

Arg Lys Pro Arg Gin Asn Asn Arg Arg Asp Arg Asn Glu Arg Arg Asp 
610 615 "* 620 

Thr Arg Ser Glu Arg Thr Glu Gly Ser Asp Asn Arg Glu Glu Asn Arg 
625 630 635 640 

Arg Asn Arg Arg Gin Ala Gin Gin Gin Thr Ala Glu Thr Arg Glu Ser 
645 650 ~ 655 

Arg Gin Gin Ala Glu Val Thr Glu Lys Ala Arg Thr Ala Asp Glu Gin 
660 665 ~ 670 

Gin Ala Pro Arg Arg Glu Arg Ser Arg Arg Arg Asn Asp Asp Lys Arg 
675 680 _ 685 

Gin Ala Gin Gin Glu Ala Lys Ala Leu Asn Val Glu- Glu Gin Ser Val 
690 695 ' 700 



Gin Glu Thr Glu Gin Glu Glu Arg Val Arg Pro Val Gin Pro Arg Arg 
705 710 ~ 715 720 

Lys Gin Arg Gin Leu Asn Gin Lys Val Arg Tyr Glu Gin Ser Val Ala 
725 730 735 

Glu Glu Ala Val Val Ala Pro Val Val Glu Glu Thr Val Ala Ala Glu 
740 745 750 

Pro lie Val Gin Glu Ala Pro Ala Pro Arg Thr Glu Leu Val Lys Val 
755 760 765 

Pro Leu Pro Val Val Ala Gin Thr Ala Pro Glu Gin Gin Glu Glu Asn 
770 775 780 

Asn Ala Asp Asn Arg Asp Asn Gly Gly Met Pro Arg Arg Ser Arg Arg 
785 790 795 " 800 

Ser Pro Arg His Leu Arg Val Ser Gly Gin Arg Arg Arg Arg Tyr Arg 
805 810 ~ 815 

Asp Glu Arg Tyr Pro Thr Gin Ser Pro Met Pro Leu Thr Val Ala Cys 
820 825 830 

Ala Ser Pro Glu Leu Ala Ser Gly Lys Val Trp lie Arg Tyr Pro lie 
835 840 845 

Val Arg Pro Gin Asp Val Gin Val Glu Glu Gin Arg Glu Gin Glu Glu 
850 855 860 

Val His Val Gin Pro Met Val Thr Glu Val Pro Val Ala Ala Ala lie 
865 870 875 880 

Glu Pro Val Val Ser Ala Pro Val Val Glu Glu Val Ala Gly Val Val 
885 890 895 

Glu Ala Pro Val Gin Val Ala Glu Pro Gin Pro Glu Val Val Glu Thr 
900 905 910 

Thr His Pro Glu Val He Ala Ala Ala Val Thr Glu Gin Pro Gin Val 
915 920 925 

He Thr Glu Ser Asp Val Ala Val Ala Gin Glu Val Ala Glu Gin Ala 
930 935 940 

Glu Pro Val Val Glu Pro Gin Glu Glu Thr Ala Asp He Glu Glu Val 
945 950 955 960 

Val Glu Thr Ala Glu Val Val Val Ala Glu Pro Glu Val Val Ala Gin 
965 970 975 

Pro Ala Ala Pro Val Val Ala Glu Val Ala Ala Glu Val Glu Thr Val 
980 985 990 

Ala Ala Val Glu Pro Glu Val Thr Val Glu -His Asn His Ala Thr Ala 
995 1000 1005 
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Pro Met Thr Arg Ala Pro Ala Pro Glu Tyr Val Pro Glu Ala Pro Arg 
1010 1015 1020 

His Ser Asp Trp Gin Arg Pro Thr Phe Ala Phe Glu Gly Lys Gly Ala 
1025 1030 1035 " 1040 

Ala Gly Gly His Thr Ala Thr His His Ala Ser Ala Ala Pro Ala Arg 
1045 1050 . 1055 

Pro Gin Pro Val Glu 
1060 



<210> 3 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 3 

gggctgcagt ttccgtgtcc atccttg 



<210> 4 
<211> 26 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 4 

gggagatctt gattactttg agctaa 



<210> 5 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 5 

ctgcagatag cccgcctaat gagcgggctt ttttttctgc ag 



<210> 6 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 6 

gcggtggtta agaaaccaaa c 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: CNRS 

(B) STREET: 3, rue Michel -Ange 

(C) CITY: PARIS 

(E) COUNTRY: FRANCE 

(F) POSTAL CODE: 75794 CEDEX 16 

(ii) TITLE OF THE INVENTION: Mutant E. coli strains, and 
their use for producing recombinant polypeptides 

(iii) NUMBER OF SEQUENCES: 6 

(iv) COMPUTER- READABLE FORM: 

(A) TYPE OF MEDIUM: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(OEB) 



(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) CONFIGURATION: linear 

(ii) TYPE OF MOLECULE: DNA (genomic) 

(ix) ADDITIONAL CHARACTERISTICS: 

(A) NAME: CDS 

(B) POSITION: 441.. 3623 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

GAAAAAACTG TGAGTAAGCG GGTGATAAAT GGTAAAAGTC ATCTTGCTAT AACAAGGCTT 
60 



40 



WO 00/08183 
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GCAGTGGAAT AATGAGGCCG TTTCCGTGTC CATCCTTGTT AAAACAAGAA ATTTTACGGA 
120 

ATAACCCATT TTGCCCGACC GATCATCCAC GCAGCAATGG CGTAAGACGT ATTGATCTTT 
180 

CAGGCAGTTA. GCGGGCTGCG GGTTGCAGTC CTTACCGGTA GATGGAAATA TTTCTGGAGA 
240 

GTAATACCCA GTCTGTTTCT TGTATAATTG CGCTGTTTTT CCGCATGAAA AACGGGCAAC 
300 

CGACACTCTG CGCCTCTTTG AGCTGACGAT AACCGTGAGG TTGGCGACGC GACTAGACAC 
360 

GAGGCCATCG GTTCACACCC GGAAAGGCGT TACTTTGCCC GCAGCTTAGT CGTCAATGTA 
42 0 

AGAATAATGA GTAAGTTACG ATG AAA AGA ATG TTA ATC AAC GCA ACT CAG 
470 

Met Lys Arg Met Leu lie Asn Ala Thr Gin 
15 10 



CAG GAA GAG TTG CGC GTT GCC CTT GTA GAT GGG CAG CGT CTG TAT GAC 
518 

Gin Glu Glu Leu Arg Val Ala Leu Val Asp Gly Gin Arg Leu Tyr Asp 

15 20 25 

CTG GAT ATC GAA AGT CCA GGG CAC GAG CAG AAA AAG GCA AAC ATC TAC 

566' 

Leu Asp lie Glu Ser Pro Gly His Glu Gin Lys Lys Ala Asn lie Tyr 
30 35 40 

AAA GGT AAA ATC ACC CGC ATT GAA CCG AGT CTG GAA GCT GCT TTT GTT 
614 

Lys Gly Lys lie Thr Arg lie Glu Pro Ser Leu Glu Ala Ala Phe Val 
45 50 55 

GAT TAC GGC GCT GAA CGT CAC GGT TTC CTC CCA CTA AAA GAA ATT GCC 
662 

Asp Tyr Gly Ala Glu Arg His Gly Phe Leu Pro Leu Lys Glu lie Ala 
60 65 70 



41 
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CGC 


GAA TAT 


TTC 


CCT 


GCT 


AAC TAC AGT GCT 


CAT 


GGT CGT CCC AAC ATT 




710 














Arg 


Glu Tyr 


Phe 


Pro 


Ala Asn Tyr Ser Ala 


His 


Gly Arg Pro Asn lie 


75 








80 




85 


90 


AAA 


GAT GTG 


TTG 


CGT 


GAA 


GGT CAG GAA GTC 


ATT 


GTT CAG ATC GAT AAA 




758 














Lys 


Asp Val 


Leu 


Arg 


Glu Gly Gin Glu Val 


lie 


Val Gin He Asp Lys 








95 




100 




105 


GAA 


GAG CGC 


GGC 


AAC 


AAA 


GGC GCG GCA TTA 


ACC 


ACC TTT ATC AGT CTG 




806 














Glu 


Glu Arg 


Gly 


Asn 


Lys 


Gly Ala Ala Leu 


Thr 


Thr Phe He Ser Leu 






110 






115 




120 


GCG 


GGT AGC 


TAT 


CTG 


GTT 


CTG ATG CCG AAC 


AAC 


CCG CGC GCG GGT GGC 




■ 854 














Ala 


Gly Ser 


Tyr 


Leu 


Val 


Leu Met Pro Asn 


Asn 


Pro Arg Ala Gly Gly 




125 








130 




135 


ATT 


TCT CGC 


CGT 


ATC 


GAA 


GGC GAC GAC CGT 


ACC 


GAA TTA AAA GAA GCA 




902 














He 


Ser Arg 


Arg 


He 


Glu Gly Asp Asp Arg 


Thr 


Glu Leu Lys Glu Ala 




140 








145 




150 


CTG 


GCA AGC 


CTT 


GAA 


CTG 


CCG GAA GGC ATG 


GGG 


CTT ATC GTG CGC ACC 




950 














Leu 


Ala Ser 


Leu 


Glu 


Leu 


Pro Glu Gly Met 


Gly Leu He Val Arg Thr 


155 








160 




165 


170 


GCT 


GGC GTC 


GGC 


AAA 


TCT 


GCT GAG GCG CTG 


CAA 


TGG GAT TTA AGC TTC 




998 














Ala 


Gly Val 


Gly 


Lys 


Ser 


Ala Glu Ala Leu 


Gin 


Trp Asp Leu Ser Phe 








175 




180 




185 


CGT 


CTG AAA 


CAC 


TGG 


GAA 


GCC ATC AAA AAA 


GCC 


GCT GAA AGC CGC CCG 




1046* 














Arg 


Leu Lys 


His 


Trp 


Glu 


Ala He Lys Lys 


Ala 


Ala Glu Ser Arg Pro 






190 






195 




2 00 


GCC 


CCG TTC 


CTG 


ATT 


CAT 


CAG GAG AGC AAC 


GTA 


ATC GTT CGC GCA TTC 




1094 














Ala 


Pro Phe 


Leu 


He 


His 


Gin Glu Ser Asn 


Val 


He Val Arg Ala Phe 



42 



WO 00/08183 PCT/FR99/01879 

■205 210 215 

CGC GAT TAC TTA CGT CAG GAC ATC GGC GAA ATC CTT ATC GAT AAC CCG 
1142 

Arg Asp Tyr Leu Arg Gin Asp He Gly Glu lie Leu He Asp Asn Pro 
220 225 230 



AAA GTG CTC GAA CTG GCA CGT CAG CAT ATC GCT GCA TTA GGT CGC CCG 
1190 

Lys Val Leu Glu Leu Ala Arg Gin His He Ala Ala Leu Gly Arg Pro 
235 240 245 * 250 

GAT TTC AGC AGC AAA ATC AAA CTG TAC ACC GGC GAG ATC CCG CTG TTC 
1238 

Asp Phe Ser Ser Lys He Lys Leu Tyr Thr Gly Glu He Pro Leu Phe 

255 260 ~ 265 

AGC CAC TAC CAG ATC GAG TCA CAG ATC GAG TCC GCC TTC CAG CGT GAA 
1286 

Ser His Tyr Gin He Glu Ser Gin He Glu Ser Ala Phe Gin Arg Glu 
270 275 280 

GTT CGT CTG CCG TCT GGT GGT TCC ATT GTT ATC GAC AGC ACC GAA GCG 
1334 

Val Arg Leu Pro Ser Gly Gly Ser He Val He Asp Ser Thr Glu Ala 
285 290 295 

TTA ACG GCC ATC GAC ATC AAC TCC GCA CGC GCG ACC CGC GGC GGC GAT 
1382 

Leu Thr Ala He Asp lie Asn Ser Ala Arg Ala Thr Arg Gly Gly Asp 
300 305 310. 

ATC GAA GAA ACC GCG TTT AAC ACT AAC CTC GAA GCT GCC GAT GAG ATT 
1430 

He Glu Glu Thr Ala Phe Asn Thr Asn Leu Glu Ala Ala Asp Glu He 
315 320 325 " 330 

GCT CGT CAG CTG CGC CTG CGT GAC CTC GGC GGC CTG ATT GTT ATC GAC 
1478 

Ala Arg Gin Leu Arg Leu Arg Asp Leu Gly Gly Leu He Val He Asp 

335 340 345 
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TTC ATC GAC ATG ACG CCA GTA CGC CAC CAG CGT GCG GTA GAA AAC CGT 
1526 

Phe lie Asp Met Thr Pro Val Arg His Gin Arg Ala Val Glu Asn Arg 
350 355 360 

CTG CGT GAA GCG GTG CGT CAG GAC CGT GCG CGT ATT CAA ATC AGC CAT 
1574 

Leu Arg Glu Ala Val Arg Gin Asp Arg Ala Arg lie Gin lie Ser His 
365 370 375 

ATT TCT CGC TTT GGC CTG CTG GAA ATG TCC CGT CAG CGC CTG AGC CCA 
1622 

lie Ser Arg Phe Gly Leu Leu Glu Met Ser Arg Gin Arg Leu Ser Pro 
380 ' 385 390 

TCA CTG GGT GAA TCC AGT CAT CAC GTT TGT CCG CGT TGT TCT GGT ACT 
1670 

Ser Leu Gly Glu Ser Ser His His Val Cys Pro Arg Cys Ser Gly Thr 
395 400 405 * 410 

GGC ACC GTG CGT GAC AAC GAA TCG CTG TCG CTC TCT ATT CTG CGT CTG 
1718 

Gly Thr Val Arg Asp Asn Glu Ser Leu Ser Leu Ser lie Leu Arg Leu 

415 420 425 

ATC GAA GAA GAA GCG CTG AAA GAG AAC ACC CAG GAA GTT CAC GCC ATT 
1766 

lie Glu Glu Glu Ala Leu Lys Glu Asn Thr Gin Glu Val His Ala lie 
430 435 440 

GTT CCT GTG CCA ATC GCT TCT TAC CTG CTG AAT GAA AAA CGT TCT GCG 
1814 

Val Pro Val Pro lie Ala Ser Tyr Leu Leu Asn Glu Lys Arg Ser Ala 
445 450 455 



GTA AAT GCC ATT GAA ACT CGT CAG GAC GGT GTG CGC TGT GTA ATT GTG 
1862 

Val Asn Ala lie Glu Thr Arg Gin Asp Gly Val Arg Cys Val lie Val 
460 465 470 
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CCA AAC GAT CAG ATG GAA ACC CCG CAC TAC CAC GTG CTG CGC GTG CGT 
1910 

Pro Asn Asp Gin Met Glu Thr Pro His Tyr His Val Leu Arg Val Arg 
475 480 485 490 

AAA GGG GAA GAA ACC CCA ACC TTA AGC TAC ATG CTG CCG AAG CTG CAT 
1958 

Lys Gly Glu Glu Thr Pro Thr Leu Ser Tyr Met Leu Pro Lys Leu His 

495 500 505 

GAA GAA GCG ATG GCG CTG CCG TCT GAA GAA GAG TTC GCT GAA CGT AAG 
2006 

Glu Glu Ala Met Ala Leu Pro Ser Glu Glu Glu Phe Ala Glu Arg Lys 
510 515 520 

CGT CCG GAA GAA CCT GCG CTG GCA ACC TTT GCC ATG CCG GAT GTG CCG 
2054 

Arg Pro Glu Gin Pro Ala Leu Ala Thr Phe Ala Met Pro Asp Val Pro 
525 530 535 

CCT GCG CCA ACG CCA GCT GAA CCT GCC GCG CCT GTT GTA GCT CCA GCA 
2102 

Pro Ala Pro Thr Pro Ala Glu Pro Ala Ala Pro Val Val Ala Pro Ala 
540 545 550 

CCG AAA GCT GCA CCG GCA ACA CCA GCA GCT CCT GCA CAA CCT GGG CTG 
2150 

Pro Lys Ala Ala Pro Ala Thr Pro Ala Ala Pro Ala Gin Pro Gly Leu 
555 560 565 570 

TTG AGC CGC TTC TTC GGC GCA CTG AAA GCG CTG TTC AGC GGT GGT GAA 
2198 

Leu Ser Arg Phe Phe Gly Ala Leu Lys Ala Leu Phe. Ser Gly Gly Glu 

575 580 585 

GAA ACC AAA CCG ACC GAG CAA CCA GCA CCG AAA GCA GAA GCG AAA CCG 
2246 

Glu Thr Lys Pro Thr Glu Gin Pro Ala Pro Lys Ala Glu Ala Lys Pro 
590 595 600 

GAA CGT CAA CAG GAT CGT CGC AAG CCT CGT CAG AAC AAC CGC CGT GAC 
2294 

Glu Arg Gin Gin Asp Arg Arg Lys Pro Arg Gin Asn Asn Arg Arg Asp 
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605 610 615 

CGT AAT GAG CGC CGC GAC ACC CGT AGT GAA CGT ACT GAA GGC AGC GAT 
2342 

Arg Asn Glu Arg Arg Asp Thr Arg Ser Glu Arg Thr Glu Gly Ser Asp 
620 625 630 

AAT CGC GAA GAA AAC CGT CGT AAT CGT CGC CAG GCA GAG CAG CAG ACT 
2390 

Asn Arg Glu Glu Asn Arg Arg Asn Arg Arg Gin Ala Gin Gin Gin Thr 
635 640 645 650 

GCC GAG ACG CGT GAG AGC CGT CAG CAG GCT GAG GTA ACG GAA AAA GCG 
2438 

Ala Glu Thr Arg Glu Ser Arg Gin Gin Ala Glu-Val Thr Glu Lys Ala 

655 660 665 

CGT ACC GCC GAC GAG CAG CAA GCG CCG CGT CGT GAA CGT AGC CGC CGC 
2486. 

Arg Thr Ala Asp Glu Gin Gin Ala Pro Arg Arg Glu Arg Ser Arg Arg 
670 - 675 ~ 680 



CGT AAT GAT GAT AAA CGT CAG GCG CAA CAA GAA GCG AAG GCG CTG AAT 
2534 

Arg Asn Asp Asp Lys Arg Gin Ala Gin Gin Glu Ala Lys Ala Leu Asn 
685 690 695 

GTT GAA GAG CAA TCT GTT CAG GAA ACC GAA CAG GAA GAA CGT GTA CGT 
2582 

Val Glu Glu Gin Ser Val Gin Glu Thr Glu Gin Glu Glu Arg Val Arg 
700 705 710 

CCG GTT CAG CCG CGT CGT AAA CAG CGT CAG CTC AAT CAG AAA GTG CGT 
2630 

Pro Val Gin Pro Arg Arg Lys Gin Arg Gin Leu Asn Gin Lys Val Arg 
715 720 725 * 730 

TAC GAG CAA AGC GTA GCC GAA GAA GCG GTA GTC GCA CCG GTG GTT GAA 
2678 

Tyr Glu Gin Ser Val Ala Glu Glu Ala Val Val Ala Pro Val Val Glu 

735 740 745 
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GAA ACT GTC GCT GCC GAA CCA ATT GTT CAG GAA GCG CCA GCT CCA CGC 
2726 

Glu Thr Val Ala Ala Glu Pro He Val Gin Glu Ala Pro Ala Pro Arg 
750 755 760 

ACA GAA CTG GTG AAA GTC CCG CTG CCA GTC GTA GCG CAA ACT GCA CCA 
2774 

Thr Glu Leu Val Lys Val Pro Leu Pro Val Val Ala Gin Thr Ala Pro 
765 770 775 

GAA CAG CAA GAA GAG AAC AAT GCT GAT AAC CGT GAC AAC GGT GGC ATG 
2822 

Glu Gin Gin Glu Glu Asn Asn Ala Asp Asn Arg Asp Asn Gly Gly Met 
780 785 790 

CCG CGT CGT TCT CGC CGC TCG CCT CGT CAC CTG CGC GTA AGT GGT CAG 
2870 

Pro Arg Arg Ser Arg Arg Ser Pro Arg His Leu Arg Val Ser Gly Gin 
795 800 805 810 

CGT CGT CGT CGC TAT CGT GAC GAG CGT TAT CCA ACC- CAG TCG CCA ATG 
2918 

Arg Arg Arg Arg Tyr Arg Asp Glu Arg Tyr Pro Thr Gin Ser Pro Met 

815 820 825 

CCG TTG ACC GTA GCG TGC GCG TCT CCG GAA CTG GCC TCT GGC AAA GTC 
2966 

Pro Leu Thr Val Ala Cys Ala Ser Pro Glu Leu Ala Ser Gly Lys Val 
830 835 840 

TGG ATC CGC TAT CCA ATT GTA CGT CCG CAA GAT GTA CAG GTT GAA GAG 
3014 

Trp He Arg Tyr Pro He Val Arg Pro Gin Asp Val' Gin Val Glu Glu 
845 850 855 

CAG CGC GAA CAG GAA GAA GTA CAT GTG CAG CCG ATG GTG ACT GAG GTC 
3062 

Gin Arg Glu Gin Glu Glu Val His Val Gin Pro Met Val Thr Glu Val 
860 865 870 

CCT GTC GCC GCC GCT ATC GAA CCG GTT GTT AGC GCG CCA GTT GTT GAA 
3110 
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Pro Val Ala Ala Ala He Glu Pro Val Val Ser Ala Pro Val Val Glu 
875 880 885 890 

GAA GTG GCC GGT GTC GTA GAA GCC CCC GTT CAG GTT GCC GAA CCG CAA 
3158 

Glu Val Ala Gly Val Val Glu Ala Pro Val Gin Val Ala Glu Pro Gin 

895 900 905 



CCG GAA GTG GTT GAA ACG ACG CAT CCT GAA GTG ATC GCT GCC GCG GTA 
3206 

Pro Glu Val Val Glu Thr Thr His Pro Glu Val He Ala Ala Ala Val 
910 915 920 

ACT GAA CAG CCG CAG GTG ATT ACC GAG TCT GAT GTT GCC GTA GCC CAG 
3254 

Thr Glu Gin Pro Gin Val He Thr Glu Ser Asp Val Ala Val Ala Gin 
925 930 935 

GAA GTT GCA GAA CAA GCA GAA CCG GTG GTT GAA CCG CAG GAA GAG ACG 
3302 

Glu Val Ala Glu Gin Ala Glu Pro Val Val Glu Pro Gin Glu Glu Thr 
940 945 950 

GCA GAT ATT GAA GAA GTT GTC GAA ACT GCT GAG GTT GTA GTT GCT GAA 
3350 

Ala Asp He Glu Glu Val Val Glu Thr Ala Glu Val Val Val Ala Glu 
955 . 960 965 970 

CCT GAA GTT GTT GCT CAA CCT GCC GCG CCA GTA GTC GCT GAA GTC GCA 
3398' 

Pro Glu Val Val Ala Gin Pro Ala Ala Pro Val Val Ala Glu Val Ala 

975 980 985 

GCA GAA GTT GAA ACG GTA GCT GCG GTC GAA CCT GAG GTC ACC GTT GAG 
3446 

Ala Glu Val Glu Thr Val Ala Ala Val Glu Pro Glu Val Thr Val Glu 
990 995 1000 

CAT AAC CAC GCT ACC GCG CCA ATG ACG CGC GCT CCA GCA CCG GAA TAT 
3494 

His Asn His Ala Thr Ala Pro Met Thr Arg Ala Pro Ala Pro Glu Tyr 
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1005 1010 1015 

GTT CCG GAG GCA CCG CGT CAC AGT GAC TGG CAG CGC CCT ACT TTT GCC 
3542 

Val Pro Glu Ala Pro Arg His Ser Asp Trp Gin Arg Pro Thr Phe Ala 
1020 1025 1030 

TTC GAA GGT AAA GGT GCC GCA GGT GGT CAT ACG GCA ACA CAT CAT GCC 
3590 

Phe Glu Gly Lys Gly Ala Ala Gly Gly His Thr Ala Thr His His Ala 
1035 1040 1045 1050 

TCT GCC GCT CCT GCG CGT CCG CAA CCT GTT GAG TAATAATTAG CTCAAAGTAA 
3643 

Ser Ala Ala Pro Ala Arg Pro Gin Pro Val Glu 

1055 1060 



TCAAGCCCTG GTAACTGC 
3661 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 amino acids 

(B) TYPE: amino acid 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Lys Arg Met Leu lie Asn Ala Thr Gin Gin Glu Glu Leu Arg Val 
1 * 5 10 15 



Ala Leu Val Asp Gly Gin Arg Leu Tyr Asp Leu Asp lie Glu Ser Pro 
20 25 " 30 

Gly His Glu Gin Lys Lys Ala Asn lie Tyr Lys Gly Lys lie Thr Arg 
35 40 45 
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lie Glu Pro Ser Leu Glu Ala Ala Phe Val Asp Tyr Gly Ala Glu Arg 
50 . 55 60 

His Gly Phe Leu Pro Leu Lys Glu lie Ala Arg Glu Tyr Phe Pro Ala 
65 70 75 80 

Asn Tyr Ser Ala His Gly Arg Pro Asn lie Lys Asp Val Leu Arg Glu 

85 " 90 95 

Gly Gin Glu Val lie Val Gin lie Asp Lys Glu Glu Arg Gly Asn Lys 
100 105 110 

Gly Ala Ala Leu Thr Thr Phe lie Ser Leu Ala Gly Ser Tyr Leu Val 
115 120 125 

Leu Met Pro Asn Asn Pro Arg Ala Gly Gly lie Ser Arg Arg lie Glu 
130 135 140 

Gly Asp Asp Arg Thr Glu Leu Lys Glu Ala Leu Ala Ser Leu Glu Leu 
145 " ~ 150 155 160 

Pro Glu Gly Met Gly Leu lie Val Arg Thr Ala Gly Val Gly Lys Ser 

165 170 175 

Ala Glu Ala Leu Gin Trp Asp Leu Ser Phe Arg Leu Lys His Trp Glu 
180 185 190 

Ala lie Lys Lys Ala Ala Glu Ser Arg Pro Ala Pro Phe Leu lie His 
195 ~ 200 205 

Gin Glu Ser Asn Val lie Val Arg Ala Phe Arg Asp Tyr Leu Arg Gin 
210 215 22Q 

Asp lie Gly Glu lie Leu lie Asp Asn Pro Lys Val Leu Glu Leu Ala 
225 230 235 240 

Arg Gin His lie Ala Ala Leu Gly Arg Pro Asp Phe Ser Ser Lys lie 

245 250 255 

Lys Leu Tyr Thr Gly Glu lie Pro Leu Phe Ser His Tyr Gin lie Glu 
260 265 ~ 270 
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Ser Gin lie Glu Ser Ala Phe Gin Arg Glu Val Arg Leu Pro Ser Gly 

275 280 285 

Gly Ser lie Val lie Asp Ser Thr Glu Ala Leu Thr Ala lie Asp lie 
290 295 300 

Asn Ser Ala Arg Ala Thr Arg Gly Gly Asp lie Glu Glu Thr Ala Phe 
305 310 315 320 



Asn Thr Asn Leu Glu Ala Ala Asp Glu lie Ala Arg Gin Leu Arg Leu 

325 330 335 

Arg Asp Leu Gly Gly Leu lie Val lie Asp Phe lie Asp Met Thr Pro 
340 345 350 

Val Arg His Gin Arg Ala Val Glu Asn Arg Leu Arg Glu Ala Val Arg 
355 360 365 

Gin Asp Arg Ala Arg lie Gin lie Ser His lie Ser Arg Phe Gly Leu 
370 375 380 

Leu Glu Met Ser Arg Gin Arg Leu Ser Pro Ser Leu Gly Glu Ser Ser 
385 390 395 400 

His His Val Cys Pro Arg Cys Ser Gly Thr Gly Thr Val Arg Asp Asn 

405 410 415 

Glu Ser Leu Ser Leu Ser lie Leu Arg Leu lie Glu Glu Glu Ala Leu 
420 425 430 

Lys Glu Asn Thr Gin Glu Val His Ala lie Val Pro Val Pro lie Ala 
435 440 445 

Ser Tyr Leu Leu Asn Glu Lys Arg Ser Ala Val Asn ' Ala lie Glu Thr 
450 455 460 

Arg Gin Asp Gly Val Arg Cys Val lie Val Pro Asn Asp Gin Met Glu 
465 470 475 480 

Thr Pro His Tyr His Val Leu Arg Val Arg Lys Gly Glu Glu Thr Pro 

485 490 495 

Thr Leu Ser Tyr Met Leu Pro Lys Leu His Glu Glu Ala Met Ala Leu 
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500 

Pro Ser Glu Glu Glu 
515 

Leu Ala Thr Phe Ala 
530 

Glu Pro Ala Ala Pro 
545 

Thr Pro Ala Ala Pro 

565 

Ala Leu Lys Ala Leu 
580 

Gin Pro Ala Pro Lys 
595 

Arg Lys Pro Arg Gin 
610 

Thr Arg Ser Glu Arg 
625 

Arg Asn Arg Arg Gin 

64 5 

Arg Gin Gin Ala Glu 
660 

Gin Ala Pro Arg Arg 
675 

Gin Ala Gin Gin Glu 
690 

Gin Glu Thr Glu Gin 
705 

Lys Gin Arg Gin Leu 

725 



505 

Phe Ala Glu Arg Lys Arg 
52 0 

Met Pro Asp Val Pro Pro 
53 5 

Val Val Ala Pro Ala Pro 
550 555 

Ala Gin Pro Gly Leu Leu 

570 

Phe Ser Gly Gly Glu Glu 
585 

Ala Glu Ala Lys Pro Glu 
600 

Asn Asn Arg Arg Asp Arg 
615 

Thr Glu Gly Ser Asp Asn 
630 635 

Ala Gin Gin Gin Thr Ala 

650 

Val Thr Glu Lys Ala Arg 
665 

Glu Arg Ser Arg Arg Arg 
680 

Ala Lys Ala Leu Asn Val 
695 

Glu Glu Arg Val Arg Pro 
710 715 

Asn Gin Lys Val Arg Tyr 

73 0 
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510 

Pro Glu Gin Pro Ala 
525 

Ala Pro Thr Pro Ala 
540 

Lys Ala Ala Pro Ala 

560 

Ser Arg Phe Phe Gly 
575 

Thr Lys Pro Thr Glu 
590 

Arg Gin Gin Asp Arg 
605 

Asn Glu Arg Arg Asp 
620 

Arg Glu Glu Asn Arg 

640 

Glu Thr. Arg Glu Ser 
655 

Thr Ala Asp Glu Gin 
670 

Asn Asp Asp Lys Arg 
! 685 

Glu Glu Gin Ser Val 
700 

Val Gin Pro Arg Arg 

720 

Glu Gin Ser Val Ala 
735 
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Glu Glu Ala Val 
740 

Pro lie Val Gin 
755 

Pro Leu Pro Val 
770 

Asn Ala Asp Asn 
785 

Ser Pro Arg His 



Val Ala Pro Val 



Glu Ala Pro Ala 
760 

Val Ala Gin Thr 
775 

Arg Asp Asn Gly 
790 

Leu Arg Val Ser 
805 



Val Glu Glu Thr 
745 

Pro Arg Thr Glu 



Ala Pro Glu Gin 
780 

Gly Met Pro Arg 
795 

Gly Gin Arg Arg 
810 



Val Ala Ala Glu 
750 

Leu Val Lys Val 
765 

Gin Glu Glu Asn 



Arg Ser Arg Arg 
800 

Arg Arg Tyr Arg 
815 



Asp Glu Arg Tyr Pro Thr Gin Ser Pro Met Pro Leu Thr Val Ala Cys 
820 825 830 

Ala Ser Pro Glu Leu Ala Ser Gly Lys Val Trp lie Arg Tyr Pro lie 
835 840 845 

Val Arg Pro Gin Asp Val Gin Val Glu Glu Gin Arg Glu Gin Glu Glu 
850 855 860 



Val His Val Gin Pro Met Val Thr Glu Val Pro Val Ala Ala Ala He 

865 870 875 880 

Glu Pro Val Val Ser Ala Pro Val Val Glu Glu Val Ala Gly Val Val 

885 890 895 



Glu Ala Pro Val Gin Val Ala Glu 
900 

Thr His Pro Glu Val He Ala Ala 
915 920 



He Thr Glu Ser Asp Val Ala Val 
930 935 

Glu Pro Val Val Glu Pro Gin Glu 
945 950 



Pro Gin Pro Glu Val Val Glu Thr 
905 910 

Ala Val Thr Glu Gin Pro Gin Val 

925 



Ala Gin Glu Val Ala Glu Gin Ala 
940 

Glu Thr Ala Asp He Glu Glu Val 
955 960 
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Val Glu Thr Ala Glu Val Val Val Ala Glu Pro Glu Val Val Ala Gin 

965 970 975 

Pro Ala Ala Pro Val Val Ala Glu Val Ala Ala Glu Val Glu Thr Val 
980 985 990 

Ala Ala Val Glu Pro Glu Val Thr Val Glu His Asn His Ala Thr Ala 
995 1000 1005 

Pro Met Thr Arg Ala Pro Ala Pro Glu Tyr Val Pro Glu Ala Pro Arg 
1010 1015 1020 

His Ser Asp Trp Gin Arg Pro Thr Phe Ala Phe Glu Gly Lys Gly Ala 
1025 1030 1035 1040 

Ala Gly Gly His Thr Ala Thr His His Ala Ser Ala Ala Pro Ala Arg 

1045 1050 1055 

Pro Gin Pro Val Glu 
1060 



(2) INFORMATION FOR SEQ ID NO : 3 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

. (D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

GGGCTGCAGT TTCCGTGTCC ATCCTTG 
27 



(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGGAGATCTT GATTACTTTG AGCTAA 
26 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) ' STRANDEDNESS: double 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 

CTGCAGATAG CCCGCCTAAT GAGCGGGCTT TTTTTTCTGC AG 
42 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) CONFIGURATION: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

GCGGTGGTTA AGAAACCAAA C 
21 



